11 May, 2020

h2.title { font-size: 8px; #color: #a9a9a9; text-align: center; }

Introduction

Data set:

  • breast cancer

  • proteomics by mass spectrometry

  • four cancer classes:

Goal:

  • Explore the data to identify patterns

  • Create models to predict breast cancer class

Material and Methods

Material and Methods

Material and Methods

Material and Methods

Results — no outliers on total protein expression

Results — breast cancer classes in the dataset are well represented

Results — breast cancer classes do not discriminate on age

Results — breast cancer and gender

Results — protein expresion heatmap

Results — dimentionality reduction

Results — K-means clustering

Results — ANN model’s structure

Results — ANN performance

Discussion

  • What could have been better

  • further work

The end